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ABSTRACT Colorectal cancer (CRC) has a connplex etiology resulting from the combination of multiple 
genetic and environmental factors, each with small effects. Interactions among susceptibility modifier loci 
make many of the loci difficult to detect in human genome-wide association studies. Previous analyses in 
mice have used classical inbred strains, which share large portions of their genomes due to common 
ancestry. Herein, we used an interspecific backcross between the Mus musculus strain A/J and the Mus 
spretus strain SPRET/EiJ to map 6 additional CRC modifier loci {Scc16-21) and 2 suggestive loci. Three loci 
modify the location of tumors along the proximal-distal axis of the colon. Six CRC modifiers previously 
mapped in intraspecific crosses were also replicated. This work confirms genetic models suggesting that 
CRC is caused by many small effect alleles and brings the catalog of reported CRC modifier loci to 23 
spread across 13 chromosomes. Furthermore, this work provides the foundation for large population-level 
epistatic interaction tests to identify combinations of low effect alleles that may have large effects on CRC 
susceptibility. 
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Colorectal cancer (CRC) is the third most commonly diagnosed form 
of cancer in developed countries, and it is responsible for the second 
largest number of cancer-related deaths. First-degree family members 
of an individual with sporadic cancer have a 2- to 3 -fold increased risk 
of developing CRC (Bishop and Thomas 1990; Stephenson et al. 
1991), indicating that shared genetic factors likely contribute to dif- 
ferential susceptibility. Twin studies demonstrate that heritable factors 
account for as much as 35% of sporadic CRC cases (Lichtenstein et al. 
2000). 

Genetic analyses in both humans and mice suggest that multiple 
small-effect alleles contribute to sporadic CRC susceptibility. Recent 
genome-wide association studies in humans have identified loci that 
increase risk associated with CRC (Broderick et al 2007; Houlston 
et al 2008; Jaeger et al 2008; Tomlinson et al 2007, 2008; Zanke et al 
2007). In an independent study, three of the loci were replicated, and 
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combined analysis suggested that the odds ratio is at most 2.6 for 
a high-risk patient with two risk alleles among the three loci (Tenesa 
et al 2008). Similar to the variability in susceptibility of humans to 
sporadic CRC, different mouse strains show varying susceptibility to 
colonic tumors (Bissahoyo et al 2005; Diwan and Blackman 1980). 

The CRC susceptibility loci identified in mice have primarily been 
based upon the azoxymethane (AOM) or related dimethylhydrozine 
(DMH) carcinogen model (Schauer et al 1969; Uronis and Threadgill 
2009; Ward 1975). AOM induces tumors in the distal mouse colon 
that resemble sporadic CRC of the descending colon in humans both 
histologically (Chang 1978; Uronis et al 2007) and molecularly (Hir- 
ose et al 2003; Kaiser et al 2007; Takahashi et al 2000; Vivona et al 
1993). Susceptibility to colon cancer 1 {Sccl) was one of the first low- 
penetrance cancer modifiers to be successfiilly mapped and cloned in 
mice using the AOM/DMH model (Ruivenkamp et al 2002). An 
additional 14 Sec loci have been mapped using the same recombinant 
congenic panel between BALB/cHeA and STS/A strains (Moen et al 
1992, 1996; Ruivenkamp et al 2003; van Wezel et al 1999), and more 
recently, one unnamed locus using genome-wide association in a panel 
of inbred mouse strains (Liu et al 2012). Two additional colon car- 
cinogenesis susceptibility loci, Ccsl and Cc52, were reported in studies 
using crosses between ICR/Ha and C57BL/6Ha and between CBA/J 
and C57BL/6J, respectively (Angel et al 2000; Jacoby et al 1994). 

A recent analysis of inbred mouse strains revealed that common 
laboratory strains have a significant, shared contribution from Mus 
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Figure 1 Genome scan for modifier 
loci controlling body weight. Signifi- 
cance thresholds are shown by dotted 
lines, and red lines mark peak LOD 
scores. 



musculus domesticus (Roberts et al. 2007; Yang et al. 2011). The 
consequence of common ancestry is that large genomic intervals are 
identical by descent, resulting in reduced genetic diversity. Conse- 
quently, previous studies in mice have analyzed only part of the mouse 
genome for CRC susceptibility loci. SPRET/EiJ (M. spretus) diverged 
from M. musculus over one million years ago and offers the potential 
to analyze additional regions of the mouse genome. Using genetic 
crosses between the AOM-resistant SPRET/EiJ strain and the suscep- 
tible A/J strain, we mapped six additional Sec loci and two suggestive 
loci influencing a variety of tumor phenotypes, including position 
along the proximal-distal axis of the colon. 

MATERIAL AND METHODS 
Genetic crosses 

A/J (A) and SPRET/EiJ (S) mice were obtained from the Jackson 
Laboratory (Bar Harbor, ME). Female A/J mice (M. musculus) were 
crossed to male SPRET/EiJ mice (M. spretus) to generate ASFl 
hybrids. Since this interspecific cross generates infertile males, ASFl 
females were backcrossed to A/J males to create a population of 235 
(ASFl)A N2 mice. 



Cancer phenotyping 

Mice, two- to- four months of age, were given four weekly intraperi- 
toneal injections of AOM at 10 mg/kg of body weight (Sigma- Aldrich, 
St. Louis, MO) as previously determined to optimize differential 
susceptibility to AOM-induced CRC (Bissahoyo et al 2005). Mice 
were weighed and killed by CO2 asphyxiation 20 weeks afi;er the last 
AOM dose. A tail clip, liver lobe, and the entire colon were removed 
from each mouse. Colons were gently flushed with phosphate buffer 
saline, mounted on bibulous paper, and splayed open along the lon- 
gitudinal axis. The number of tumors, their sizes, and locations along 
the proximal-distal axis were recorded. 

Genotyping 

DNA was extracted from liver or tail samples from each mouse using 
Puregene DNA Purification Kit (Promega, Madison, WI). Mice from 
the (ASFl)A N2 generation were genotyped using a custom Seque- 
nom MassARRAY SNP Genotyping platform containing 183 single 
nucleotide polymorphisms (SNP) markers (GeneSeek, Lincoln, NE). 
The custom platform design included SNP markers from NCBI Build 
37 spaced at 10-15 cM intervals selected to be informative between 




Figure 2 Colon tumor penetrance of parental strains, F1 hybrids, and 
N2 backcross mice used in the current study. 
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W Table 1 Point-wise analysis of previously mapped colon cancer susceptibility loci 



Locus 


blNJr IVIarker 


Chr 


IVIb 


Presence/Absence P-value 


Tumor Multiplicity P-value 


Tumor Load P-value 


Sec 7 


rs3716390 


2 


81.6 


0.55 


0.81 


0.41 


Scc2 


rs4223152 


2 


52.5 


0.64 


0.45 


0.72 


Scc3 


rs8253293 


1 


196.9 


0.28 


0.37 


0.13 


Scc4 


rs4231637 


17 


73.5 


0.13 


0.93 


0.78 


Scc5 


UNC_18_37970902 


18 


37.9 


0.66 


0.02 


0.02 


Scc6 


rs4228590 


11 


17.1 


0.74 


0.11 


0.15 


Scc7 1 Ccs2 


rs6322812 


3 


159.5 


0.08 


0.45 


0.73 


Scc8 


rs3701395 


8 


3.1 


0.25 


0.09 


0.02 


Scc9 


rs1 3480754 


10 


106.9 


0.05 


0.47 


0.70 


Sec 10 


rs1 6822005 


2 


91 


0.42 


0.85 


0.41 


Scc11 


rs3024208 


4 


130.4 


0.03 


0.09 


0.25 


Scc12 


rs1 6807506 


7 


135 


0.53 


0.69 


0.85 


Scc13 


rs29973570 


6 


72.5 


0.03 


0.03 


0.02 


Sec 14 


rs6280091 


10 


31.4 


0.54 


0.66 


0.87 


Seel 5 


rs4228762 


11 


58 


<0.01 


0.11 


0.18 


Cesi 


rs8261201 


12 


88.5 


0.24 


0.12 


0.35 


P-values lower than 0.05 are marked in bold. 



the A/J and SPRET/EiJ mouse strains. All genotype and phenotype 
data are provided in Supporting Information, File SI. 

QTL mapping 

Genotype probabilities were calculated using the Haldane map 
algorithm in J/qtl software (http://research.jax.org/faculty/churchill/ 
software/ Jqtl/index.html), a Java interface for R/qtl (Broman et al 
2003). As none of the tumor pheno types could be transformed to 
fit a Gaussian distribution when all mice were included, the binary 
phenotype of presence or absence of tumors was used for the initial 



analysis. Binary analysis was performed by obtaining the maximum- 
likelihood estimates using the EM algorithm at 2 cM intervals 
throughout the genome (Broman 2003). Mice without tumors were 
excluded from further quantitative analyses. Genetic analysis of body 
weight was performed on nontransformed data using all mice. 

The phenotypic measurements tumor number, average tumor 
diameter, maximum tumor diameter, and tumor position along the 
proximal- distal colonic axis from mice with tumors (n = 113) were 
transformed using rank z-transformation parameters and analyzed as 
normally distributed phenotypes. One-dimensional genomes scans 
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Chromosome 

Figure 3 Genome scan for modifier loci controlling the binary tumor presence phenotype. Significance thresholds are shown by dotted lines, and 
red line marks peak LOD score. Inset shows allele effect plot of the peak marker on Chr 1 1 . 
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W Table 2 Summary of colon cancer modifier loci identified in an (ASF1)A N2 backcross 



Locus 


Phenotype 


CMP ^/l-^rU^r 

oiNir iviarKer 


v^nr 


IVID 


L^onT int ^civij 


lkju ocore 


P-value 


Scc^6 


Tumor presence 


rsl 6808928 


11 


116.4 


32.0-71.0 


3.83 


<0.01 


Sec 17 


Tumor number 


rsl 3479769 


8 


55.6 


12.0-70.8 


2.66 


0.04 


Chr 1 


Tumor number 


rs3658044 


1 


19.2 


0-20.0 


2.291 


0.11 


Scc18 


1 U 1 1 lUI lUdvJ 


1 b 1 oouoo / ^ 




88.7 


0 Sfi 7 

OO.^ vJO. / 


2.825 


0.04 


Sec 19 


Max tumor size x 


rsl 34821 18 


14 


30.6 


5.8-20.2 


3.389 


0.05 




sex/tumor load 














See20 


Rel distal position x 


rs8238935 


1 


58.1 


9.1-34.5 


4.067 


<0.01 




tumor number 


























Full / Additive 


Alpha 


Seel 7 X See21 


Tumor load 


rsl 3479769 X rsl 681 0780 


8 X 1 


55.6 X 34.5 




5.565 /4.581 


0.10 


Seel 8 X Seel 9 


Tumor load x sex 


rsl 6805672 x rsl 34821 18 


6x 14 


88.7 X 30.6 




6.236 / 6.190 


0.05 














Full / Interaction 


Alpha 


Chr 1 X Chr 10 


Rel distal position x 


rs3658044 x NCBL1 0_991 87828 


1 X 10 


19.2 X 99.1 




7.358 / 6.351 


0.10 



tumor number 



Physical location of SNP markers. 

1.5 LOD confidence intervals based on genetic analysis. 



were performed using the EM algorithm. Tumor load (tumor number 
X average tumor diameter) could not be transformed to fit a normal 
distribution and was analyzed using an extension of the Wilcoxon 
rank sum test for non-parametric interval mapping (Broman 2003; 
Kruglyak and Lander 1995). No secondary loci were detected in any of 
the one- dimensional genome scans when controlling for the primary 
locus that was detected in each analysis. Only loci that reached ge- 
nome-wide significance were given official designations. 

Two-dimensional analysis of tumor pheno types was performed 
using Haley-Knott regression algorithms to identify epistatic inter- 
actions. Significance thresholds were calculated by performing 1000 
permutations with intervals set at the 182 SNP markers used in the 
analysis. The additive and interaction models were used to determine 
the type of epistasis influencing phenotypic variance. 

Pointwdse Wilcoxon rank sum test was performed on SNP markers 
in close proximity to previously identified Sec loci (Moen et al 1992, 
1996; Ruivenkamp et al 2003; van Wezel et al 1999). 

RESULTS 

Validation of the A/J x SPRET/EiJ cross for 
modifier mapping 

To validate the utility of the interspecific backcross population for 
mapping modifiers of CRC susceptibility, modifier loci controlling 
body weights of (ASFl)A N2 mice were analyzed, and their locations 
were compared with previously identified obesity modifiers in SPRET/ 
EiJ crosses. Loci on Chr 7 (Mobl) and Chrl2 {Mob3)y originally 
reported in a (C57BL/6J x SPRET/EiJ) x C57BL/6J backcross (Fisler 
et al 1993; Warden et al 1995; Yi et al 2004), were replicated (Figure 
1). Additionally, two previously reported body weight modifiers on the 
X chromosome (Bwl and Bw3) were replicated. Bwl and Bw3 were 



previously reported in (A/J x SPRET/EiJ) x C57BL/6J and (C3H/He x 
SPRET/EiJ) X C57BL/6J crosses, respectively (Dragani et al 1995). 

Replication of previously mapped Sec loci 

When exposed to four weekly doses of AOM, 100% of A/J mice 
developed colonic tumors, whereas SPRET/EiJ mice were completely 
resistant (Figure 2). Genome-wide, SPRET/EiJ is dominant to the 
susceptible A/J strain, as less than 5% of ASFl hybrids develop 
tumors. Almost 50% of the (ASFl)A N2 backcross mice (113 of 
235) developed colorectal tumors in response to AOM exposure. 

To determine whether previously mapped colon tumor suscepti- 
bility loci also contribute to differential susceptibility in the (ASFl)A 
N2 population, point-wise analysis was performed using the SNP 
marker nearest to previously identified Sec and Ccs loci (Table 1) 
(Angel et al 2000; Jacoby et al 1994; Moen et al 1992, 1996; Rui- 
venkamp et al 2003; van Wezel et al 1996, 1999). Markers near Scc9, 
SccUy Sccl3y and Sccl5 showed significant association with tumor 
presence, whereas Scc5 and Sec 13 were associated with tumor number 
and tumor load. Scc8 was only associated with tumor load. No other 
Sec or Ccs loci showed an association with colon tumor phenotypes in 
the (ASFl)A N2 population, and none of the previously mapped loci 
reached significance when performing a genome-wide analysis, con- 
sistent with being low- effect susceptibility alleles. Since SNPs were not 
selected based on location of previous colon cancer susceptibility loci, 
some markers were more distant, resulting in reduced power to rep- 
licate previous results. 

SPRET/EiJ harbors a major colon cancer resistance locus 

Binary analysis of tumor presence (>1 tumor, n = 113 mice) and 
absence (0 tumors, n = 125 mice) detected a single modifier locus 
(LOD = 3.83; Figure 3). This locus, designated Sccl6 (Table 2), was 



M Table 3 Tumor phenotype correlation matrix 

1^ Tumor Load Max Tumor Diameter Avg Tumor Diameter Relative Position Tumor Number 



Tumor load 1.00 0.68 0.50 0.47 0.89 

Max tumor diameter 1.00 0.86 0.31 0.47 

Avg tumor diameter 1.00 0.17 0.23 

Relative position 1 .00 0.45 

Tumor number 1 .00 
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Figure 4 Genome scan for modifier 
loci controlling the tumor number 
phenotype. Significance thresholds 
are shown by dotted lines, and red 
line marks peak LOD score. 



Chromosome 



most strongly associated with rsl6808928 (Chr 11, 71 cM). A tumor 
effect plot revealed that SPRET/EiJ contributes a tumor-resistance 
allele at Sccl6 (Figure 3, inset); mice heterozygous at rsl6808928 have 
a significantly increased likelihood of being tumor free than mice 
homozygous for the A/J allele (t test, P < 0.0001). 

Colorectal cancer phenotypes are determined by 
oligogenic modifier loci 

Mice without tumors were excluded fi-om analysis of quantitative tumor 
phenotypes, including tumor number, tumor load, average tumor 
diameter, maximum tumor diameter, and tumor position along the 
colon, phenotypes that are weakly correlated within the (ASFl)A N2 
backcross population (Table 3). Only average tumor diameter and max- 
imum tumor diameter, as well as tumor number and tumor load have 
values greater than 0.7. The fact that average tumor size was not corre- 
lated with tumor load, which is the product of average tumor size and 
tumor number, suggests that tumor number is much more variable than 
average tumor diameter and is the major determinant of tumor load. 

Tumor number is the only phenotype that was not transformable 
into a normal distribution. Non-parametric methods detected a single 
modifier on Chr 8 named Sccl7 (rsl3479769, 30 cM; LOD = 2.66; 
Table 2), which explains 10.1% of the variance in tumor number (P = 
0.001). An effect plot revealed that mice heterozygous at the Sccl7 
locus have fewer tumors than mice homozygous for the A/J allele {t 
test, P = 0.0003), showing that the A/J allele increases tumor number. 
An additional putative tumor number modifier on Chr 1 (rs3658044) 
was suggestive; this locus potentially explains 7.6% of the phenotypic 
variance (P = 0.003). 

Analysis of tumor load also identified one modifier on Chr 6 
named Sccl8 (rsl6805672, 37 cM; LOD = 2.826; Table 2), which 
explains 11.4% of the variance (P < 0.0001). Interestingly, the 
SPRET/EiJ Sec 18 allele increases tumor load. No loci were detected 
that modified average or maximum tumor diameter. However, a sex- 
dependent modifier locus on Chr 14 named Sccl9 was detected for 
maximum tumor size (LOD = 3.421); Sccl9 explains less than 1% of 
the variance (P = 0.325). 



The colon has a proximal-distal axis with regional differences in cell 
composition and fianction. Modifiers influencing tumor position within 
the colon have not been reported. To identify whether modifiers 
influence tumor position, a genome scan was performed using 
measurements of tumor position along the proximal-distal axis of the 
colon. Although no significant associations were detected using tumor 
position alone, if tumor number were considered as a covariate, one 
modifier named Scc20 was identified on Chr 1 (rs8238935, 32 cM; LOD 
= 4.067; Figure 4). This locus is proximal to the suggestive tumor 
number modifier Chr 1 (rs3658044) with an overlapping confidence 
interval; however, it is likely to be a distinct locus because the cor- 
relation between tumor number and position is only 0.45 (Table 3). 
Mice homozygous for the A/J Scc20 allele are likely to have tumors that 
are positioned more distally than mice heterozygous at the Scc20 locus. 

Epistatic interactions among modifiers 

Two-dimensional modifier scans were performed on all tumor 
phenotypes to detect allele combinations that may have gone 
undetected using one- dimensional scans. Analysis of tumor load 
revealed one significant (a = 0.05) and one suggestive interaction (a = 
0.1; Figure 5 and Table 2). The suggestive additive interaction was 
detected between Chr 8 (Sccl7) and a locus on Chr 1 named Scc21 
(fiill LOD = 5.565, and additive LOD = 4.581). An effect plot of tumor 
load shows that the tumor load-increasing effect of the A/J allele at 
Scc21 is observed only when mice are also homozygous for the A/J 
allele at Sec 17 (Figure 5, inset). A larger study will be required to 
validate putative epistatic interactions between loci on Chrs 1 and 6, 
Chrs 6 and 8, and Chrs 6 and 18 that do not reach the suggestive 
threshold set for interactions (Figure 5), as well as for the detection of 
interactions previously reported such as between Sse7 and SeeS (van 
Wezel et al 1996). 

When sex was used as a covariate with tumor load, a significant 
interaction was detected between Seel8 and Seel9 (full LOD = 6.236, 
and additive LOD = 6.190, Table 2). 

A suggestive interaction was also detected for tumor position, but 
only if tumor number was used as a covariate (Table 2). The 
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interaction LOD between (rs3658044) and a locus on Chr 10 
(NCBI_10_99187828) is 7.358 (full-add). 

DISCUSSION 

Several loci influencing CRC susceptibility have been identified in 
crosses between resistant and susceptible mouse strains. Numerous 
studies using recombinant congenic strains led to the identification of 
15 Sec loci, with almost half being involved in two-way interactions 



(Moen et al 1992, 1996; Ruivenkamp et al 2003; van Wezel et al 
1996, 1999). Analyses of additional crosses were used to identify Ccsl 
and Ccs2 (Angel et al 2000; Jacoby et al 1994). We employed an 
interspecific backcross to replicate six of the previously reported loci 
and to identify six additional loci, Seel6-21 (Figure 6). This brings the 
number of reported colon cancer susceptibility loci in mice to 23 
spread across 13 chromosomes, with half of the known loci being 
detected in the interspecific backcross reported here. Similar to 
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previous observations, numerous interactions between modifiers were 
observed, suggesting that networks of interacting, low-penetrance 
alleles determine CRC susceptibility. The small effect of most modi- 
fiers highlights the difficulties involved in identifying allelic combina- 
tions that influence cancer susceptibility in humans and that likely 
contributed to several previous modifier loci not replicating in the 
current study. Lack of replication could also be due to genetic differ- 
ences among the strain combinations used and/or Type I errors. 

A recently reported genome-wide association study using an 
intrasubspecific panel of mouse strains also replicated some previously 
detected Sec and Ccs loci (Liu et al 2012), although only one, Scc8, was 
in common with the present study. The lack of concordance across the 
three approaches (intrasubspecific linkage, intrasubspecific association, 
and intersubspecific linkage) underscores the difficulties in detecting 
low-penetrance allele effects and the variation in loci detected that is 
dependent upon the genetic structure of the target populations. 

Genome- wide analysis of an interspecific backcross between SPRET/ 
EiJ and A/J reported here detected a locus {Sccl6) with a major effect 
controlling penetrance of AOM-induced tumors. Tumor load was most 
strongly influenced by Sccl8 located on Chr 6. Furthermore, Sccl7, 
modulating tumor number, and Scc21 were found to interact to affect 
tumor load. Tumor load is also regulated in a sex-dependent fashion by 
Sccl9y which was detected individually using maximum tumor diameter. 

Modifiers also appear to influence the relative position of tumors 
along the proximal-distal axis of the colon. Scc20 on Chr 1 and an 
interaction between suggestive loci on Chr 1 (rs3658044) and Chr 10 
(NCBI_10_99187828) modify spatial positioning of tumors. Chr 10 
(NCBI_10_99187828) is proximal to Scc9 with both loci being located 
in a region of conserved synteny with human Chr 12. Interestingly, 
several studies report hypermethylation, overexpression, and chromo- 
somal loss of genes associated with colon cancer on 12q (Nakao et al 
1998; Richter et al 2003; van Dieren et al 2006). The detection of Chr 
10 (NCBI_10_99187828) in the present study increases the possibility 
that one or both of these loci represent alleles whose orthologs may be 
associated with cancer susceptibility in humans. 

A number of genome-wide association studies have been con- 
ducted recently in humans, which revealed several common variants 
that function as low-penetrance susceptibility alleles. Although none 
of the mouse susceptibility loci identified here appear to be in 
conserved syntenic segments with those detected in the human 
studies, the mouse loci may function in similar pathways or may be 
detectable only when allele interactions are tested. Future identifica- 
tion of the underlying genes responsible for the Sec loci should reveal 
the relationship between mouse and human cancer susceptibility and 
how genetic modifiers influence susceptibility. 
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